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AMPLIFICATION OF NUCLEIC ACIDS 

CROSS-RELATED APPLICATIONS 

This application is a continuation-in-part of U.S. Application No. 
08/564,653, filed November 29, 1995. 

TECHNICAL FIELD 

The present invention relates generally to amplification of nucleic acids 
and, in pauticular, to amplification of G+C-rich templates. 

BACKGROUND OF THE INVENTION 

Amplification of nucleic acids has revolutionized molecular biology and 
is now one of the most rapid and versatile methods of producing large quantities of 
DNA for molecular analysis. Despite the power of this technique, uniform 
amplification of all templates, regardless of length or G+C content, remains a challenge. 
Amplification of longer nucleic acid fragments (up to 35 bp) has been accomplished by 
utilizing different en2ymes selected for their thermal stability and editing functions (see 
PCR Methods and Applications 2:51, 257, 1993; Lundberg et al., Gene 108:1, 1991; 
Mattila et aL, Nucleic Acids Res. J 9:4961, 1991; Barnes, Proc. NatL Acad. Sci, USA 
9i;2216, 1994; Cheng et al, Proc. NatL Acad, Sci. USA 97:5695, 1994; Jeffreys et al.. 
Nucleic Acids Res, 7(5:10953, 1988; Krishnan et al.. Nucleic Acids Res. 79:6177, 1991; 
Maga and Richardson, Biotechniques 77:185, 1991; Rychlik et al.. Nucleic Acids Res. 
75:6409, 1990; and Kainz etaL, Anal. Biochem, 202:46, 1992). However, templates 
with high G+C content, regardless of length, are only variably amplifiable or sometimes 
completely imamplifiable even in the presence of reagents that facilitate strand 
separation, stabilize the polymerase, or isostabilize DNA {e.g., dimethyl sulfoxide 
(DMSO), formamide, glycerol, or tetramethylanunonium chloride (TMACl)) 
(Bookstein et al., Nucleic Acids Res. 75:1666, 1990; Sarkar et al.. Nucleic Acids Res. 
75:7465, 1990; Pomp and Medrano, Biotechniques 70:58, 1991; and Hung et al., 
Nucleic Acids Res. 75:4953, 1990). 

Although the average G+C content of the human genome is about 40%, 
individual genes and genetic elements may have G+C content that is higher. For 
example, the human c-myc gene is 60% G+C with regions of greater of 70% G+C. As 
well, the majority of 5' ends of genes and promoters have regions that are G+C rich, and 
some diseases, such as Fragile X Syndrome, result from the in vivo expansion of G+C 



rich triplets (e.g., CGG for Fragile X Syndrome). Detection of these triplets in both 
normal and diseased individuals is difficult. Furthermore, the inability to uniformly 
amplify DNAs with high G-+-C content hinders other methods, such as quantitation of 
transcripts, gene mapping and sequence analysis. 
5 In the present invention, compositions and methods are provided for 

amplifying nucleic acids regardless of their G+C content as well as providing other 
related advantages. 

SUMMARY OF THE INVENTION 
10 This invention generally provides methods and compositions for 

increasing the efficiency of amplification of nucleic acids, especially nucleic acids with 
high G+C content. 

In one aspect, a reaction mixture is prepared by mixing nucleic acid 
templates, one or more primers, nucleotides, a first DNA polymerase and a seconH 
1 5 DNA polymerase that has 3' exonuclease activity, and adding to the reaction mixfiu-e a 
zwitterion and a compound that disrupts base pairing in an amount sufficient to increase 
amplification of an 80% G+C, 500 bp DNA fragment by two-fold, when the zwitterion 
and the compound are present as compared to when the zwitterion and the compound 
are absent. In a related aspect, the reaction mixture is prepared by mixing a 
20 homogeneous nucleic acid template, one or more primers, nucleotides, a first DNA 
polymerase and a second DNA polymerase that has 3' exonuclease activity, and adding 
to the reaction mixture a zwitterion or a compound that disrupts base pairings in an 
amount sufTicient to increase amplification of an 80% G+C, 500 bp DNA fragment by 
two-fold, when the zwitterion or compound are present as compared to when the 
25 zwitterion or compound are absent. 

In one embodiment, the first DNA polymerase lacks a 5' exonuclease 
activity. In a preferred embodiment, the first DNA polymerase is Klentaql DNA 
polymerase and the second DNA polymerase is Pfu DNA polymerase. In other 
embodiments, the DNA polymerase pairs are xTth DNA polymerase and Thermococcus 
30 litoralis DNA polymerase; Taq DNA polymerase and Pyrococcus DNA polymerase; 
Taq DNA polymerase and Pwo DNA polymerase. 

In other embodiments, the zwitterion is selected from the group 
consisting of betaine, monomethyl glycine, dimethyl glycine, and D-camitine. In yet 
another embodiment, the compound that disrupts base pairing is dimethylsulfoxide or 
35 formamide. In a preferred embodiment, the zwitterion is betaine and the compound is 
DMSO. 
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In yet other embodiments, the nucleic acid template is selected fix>m the 
group consisting of genomic DNA, cDNA, plasmid DNA, DNA fragment, and viral 
DNA. 

These and other aspects of the invention will become evident upon 
5 reference to the following detailed description and attached drawing. In addition, 
various references are set forth below which describe in more detail certain procedures 
or compositions. Each of these references are incorporated herein by reference in their 
entirety as if each were individually noted for incorporation. 

10 BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCE LISTING 

Figures lA and IB are photomicrographs depicting the electrophoretic 
screening data for the various test fragments and test solutions according to the present 
invention. 

Figure 2 is a photomicrograph depicting the electrophoretic screening 
1 5 data for the amplification of the CGG triplet repeat in Fragile X Syndrome. 

Figure 3 is a photograph of amplification of cDNAs from a small 
number of cells and tissues. M, molecular weight markers; JTP, cDNA from activated 
Jurkat T cells; J 1000, cDNA from 1000 Jurkat cells; J5000, cDNA from 5000 Jurkat 
cells; Earl, cDNA from human inner ear; Ear2, cDNA from human inner ear. 
20 SEQ ID No. 1 is a DNA sequence of the Ml 3 forward primer. 

SEQ ID No. 2 is a DNA sequence of the Ml 3 reverse primer, 
SEQ ID No. 3 is a DNA sequence of MHCclnl, which is derived from 
the MHC gene region. 

SEQ ID No. 4 is a partial DNA sequence of human OCT-Tl cDNA. 
25 SEQ ID No. 5 is a DNA sequence of human OCTl cDNA. 

SEQ ID No. 6 is a sequence of a primer used to amplify the repeat region 
of the Fragile X gene, FMRl. 

SEQ ID No. 7 is a sequence of a primer used to amplify the repeat 
region of the Fragile X gene, FMRl. 
30 SEQ ID No. 8 is a partial DNA sequence of TFR, the human transferrin 

receptor. 

DETAILED DESCRIPTION OF THE INVENTION 

Prior to setting forth the invention, it will be helpful to an understanding 
35 of the invention to define certain terms that are used hereinafter. 
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Nucleotides are depicted according to their recognized abbreviations, 
that is, '*A" refers to adenine, refers to cytosine, "G" refers to guanine, "T" refers to 
thymine, and "N" refers to either A or C or T or G. 

As used herein, "amplification" refers to the increase in the number of 
5 copies of a particular nucleic acid fragment resulting from an enzymatic reaction, such 
as polymerase chain reaction, ligase chain reaction, or the like. 

As used herein, "oligonucleotide" refers to a nucleic acid molecule 
comprising two or more deoxyribonucleotides or ribonucleotides, and preferably more 
than three. An oligonucleotide may be synthesized or produced by amplification or 
10 cloning. 

As used herein, "polymerase chain reaction" or "PGR" refers to a 
particular process of amplification for the exponential amplification of a specific DNA 
fragment by utilizing two oligonucleotide primers that hybridize to opposite strands and 
flank the region of interest in a target DNA. The process consists of a repetitive series 
15 of cycles involving template denaturation, primer annealing, and the extension of the 
annealed primers by a DNA polymerase, such as the thermostable Tag DNA 
polymerase. 

As used herein, a "primer" is an oligonucleotide that anneals to one 
strand of a nucleic acid template and allows elongation of a complementary strand by a 
20 polymerase. 

As used herein, "template" refers to a sequence of nucleotides that can be 
copied by a DNA polymerase from a primer bound to the template. 

As used herein, a "zwitterion" refers to a molecule that is a dipolar ion. 
At the isoelectric pi of the molecule, it will have no net charge; at a pH range between 
25 approximately the and the K^, the molecule bears little net charge. 

I. Amplification of nucleic acid templates 

Amplification using thermostable enzymes generally involves multiple 

cycles of denaturation, aimealing, and synthesis performed at different temperatures. 
30 An amplification reaction typically contains either heterogeneous or homogenous 

template nucleic acid, one or more oligonucleotide primers, nucleotides, various buffers 

and cofactors, and DNA polymerase. To amplify heterogeneous G+C rich templates, a 
' zwitterion and compound that disrupts base pairing are added to the mix; if the template 

is homogeneous (e.g., plasmid clone DNA), either a zwitterion or compound is added. 
35 Protocols vary somewhat in concentrations of reagents, temperatures, 

and incubation times, depending upon the apparatus, length of amplified product. 
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enzyme used, and other parameters. Protocols are readily available {see, for example, 
manufacturer's instructions, Ausubel et al.. Current Protocols in Molecular Biology, 
Greene Publishing, 1995; and Examples herein). Generally, a "hot-start" procedure is 
used, so that all reagents, save one, such as nucleotides, are mixed, and the temperature 
S is raised to at least annealing temperature before the saved reagent is added. 
Amplification reactions may be analyzed by any of a variety of known methods. For 
example, the reaction may be electrophoresed in a polyacrylamide or agarose gel. The 
amplified material may be detected by staining with a dye, such as ethidium bromide, or 
by autoradiography if a radioactive label is incorporated. 

10 

IL Zwitterions and compounds that disrupt base pairing 

As noted above, the present invention provides compositions comprising 
a zwitterion or dipolar ion in combination with a compound that disrupts base pairing to 
enhance and improve amplification of nucleic acids that are G+C rich. Typically, 

1 5 examples of G+C rich regions that show improvement of amplification in the pfesent 
invention contain about 60% G+C over greater than 50 nucleotides, about 80% G+C in 
up to 500 nucleotide region, and 100% G+C in up to about 200 nucleotides. 

A zwitterion contains both an acidic group and a basic group. The acidic 
group is associated and the basic group is dissociated at pH values between about the 

20 pKg (the pH at which the acidic group is half associated and half dissociated) and about 
the pKfe (the pH at which the basic group is half associated and half dissociated). 
Dipolar ions are encountered, for example, whenever a molecule contains both an 
amino group and an acid group, providing that the amine is more basic than the anion of 
the acid. For example, all amino acids exist in a zwitterionic form between about pH 4 

25 and about pH 9. Over this pH range, the amino acids bear little net charge, and at the 
isoelectric point, pi, bear no net charge. 

The zwitterions for use in the present invention must have a b£isic group 
containing at least one alkyl group, which is preferably a methyl group. Preferably the 
basic group is an amine. A general formula for the zwitterion is thus: 

30 

(R)3 N-(CH,)rCOO- 

wherein R is H, CH3, CH2-CH3, or (CH.)j-CH3, at least one R is not H, i is from 2 to 
35 about 6, and j is from 2 to about 6. Preferred zwitterions are betaine (trimethyl 
glycine), D-camitine, dimethyl glycine, and monomethyl glycine. 
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For use in the present invention, the zwitterion should not inhibit DNA 
polymerases. It is preferable that the DNA polymerase retain at least 90% of activity at 
2.5 M zv/itterion concentration. The effect of the zwitterion on nucleic acid molecules 
is a decrease in the Tm of GC base pairs and an increase in the Tm of AT base pairs. 
5 The decrease or increase is preferably at least 2**C, at least 3**C, at least 4**C, and most 
preferably at least S^^C. 

Compounds that disrupt base pairing (e.g., disrupt hydrogen bonding) 
include DMSO, formamide, sodium perchlorate, glyoxyl, and the like. Such 
compounds may be identified by their ability to lower the melting temperature of a 
10 nucleic acid duplex. Such compounds should not inhibit DNA polymerases. It is 
preferable that the DNA polymerase retain at least 90% of activity at a 10% compound 
concentration. 

A zwitterion or a compound that disrupts base pairing cem be tested for^ 
utility in the context of the present invention by demonstration of at least a two fold 

1 5 increase in the amount of amplified product from a homogeneous template of at least 
80% G+C, in the presence of the zwitterion or compound as compared to the absence of 
the zwitterion or compound. When heterogeneous templates (/.e., multiple templates) 
are to be amplified, the zwitterion and compound that disrupts base pairing are tested in 
combination. The zwitterion and compound are added in amounts sufficient to cause an 

20 increase in efficiency of amplification. In general, the zwitterion is added from about 
O.l M to about 3 M, from about 0.5 M to about 4.5 M, or from about 1 M to about 2.5 
M. In general, the compound is added from about 5% to about 15%, or preferably from 
about 5% to about 10%. 

25 III. Two enzyme amplification systems 

As noted above, an improvement in amplification is achieved when 
amplification is performed in a two-enzyme system, • 

Two-enzyme systems are used for amplification of long fragments. In 
general, one of the enzymes has a proofreading activity (3' -> 5' exonuclease) that 
30 corrects nucleotide misincorporations that may otherwise prematurely terminate DNA 
synthesis, and the other enzyme is highly processive. Preferably, one or both of the 
enzymes lack 5' 3' exonuclease activity. Also, preferably, the enzymes are 
thermostable. Examples of combinations of enzymes that are commonly used are 
Klentaql plus Pfu DNA polymerases (1:15 v/v), rTth plus Vent® isolated from 
35 Thermococcus litoralis DNA polymerases (Perkin-Elmer, Foster City, CA); Taq and 
Pyrococcus DNA polymerases (Life Technologies, Gaithersburg, MD); and Taq and 



Pwo DNA polymerases (Boehringer-Mannheim, Indianapolis, FN). Other combinations 
may be used. 

IV. Application of amplification of G+C rich templates 

5 As described herein, the present invention provides compositions and 

methods for uniform amplification of nucleic acid templates regardless of G-*-C content. 
Such amplification allows accurate quantitation, detection of G+C rich regions, 
diagnosis of certain diseases, and improves construction of cDNA libraries, DNA 
sequence analysis, and the like. For example, 5' ends of genes are G+C rich and are 

10 often not amplified well during construction of a cDNA library. Some coding 
sequences are G+C rich and will also be underrepresented in a cDNA or genomic 
library. Without the present invention, if the starting material v/as a cDNA library with 
G+C-rich cDNA present in medium abundance (<0.1%), upon amplification, this G+C- 
rich nucleic acid would be less efficiently amplified and gradually lost. Thus, to find 

1 5 the cDN A in a screen by hybridization, several million clones would likely need to be 
examined. Using the present invention, the G+C rich cDNA should be present in the 
amplified material at the same level as in the starting material. Following amplification 
of cDNA fi-om RNA isolated from a small number of cells, differential display 
amplification can be performed. 

2*0 Moreover, several genetic diseases are caused by a natural expansion of 

a G-+-C-rich triplet. Such diseases include fi-agile X syndrome, spinobulbar muscular 
atrophy, myotonic dystrophy, Huntington's chonea, and spinocerebellar atoxia type I . 

Furthermore, in addition to enhancing amplification, the present 
invention may improve the efficiency of nucleic acid synthesis in general, including 

25 reverse transcription, in vitro transcription, primer extension and the like. 

EXAMPLES 

EXAMPLE 1 

30 AMPLIFICATION OF G+C RjCH TEMPLATES USING TaQ DNA POLYMERASE 

AND BETAINE OR DMSO 

A number of test DNAs are utilized in amplification reactions. Four of 
the test fragments are: 



g 

(i) a 2.7 kb cDNA encoding the octamer binding protein, OCT I 
(GenBank accession No. X13403; Sturm et al.. Genes & Dev. 2:1582, 1988; SEQ ID 
No. 8), which has a 52% CH-C content; 

(ii) a 1.6 kb fragment from the MHC region, MHCclnl, which has a 64% 
5 G+C content (see Shukia et al.. Nucleic Acids Res. /9:4233, 1991; GenBank Accession 

No. L20433; SEQ ID No. 3, which presents nucleotides 320 to 850 of L20433); 

(iii) a 0.8 kb fragment of human OCT-Tl cDNA, which encodes a 
transcription factor expressed in T lymphocytes that binds the octamer sequence, and 
which has an 80% G+C content (see Bhargava et al., Proc. Natl. Acad Sci. USA 

10 97:10260, 1993; SEQ ID NO. 4); and 

(iv) a 0.72 kb fragment of human transferrin receptor, TFR, which has a 
44% G+C content {Nature 577:675, 1984; GenBank Accession No. X01060; SEQ ID 
No. 5, which presents nucleotides 352 to 866 of XO 1 060). 

As each of these fragments are cloned in pBluescript or a pUC plasmid 
15 (MHCclnl). primers for amplification of these fragments are the Ml 3 forward primer" 
(CGCCAGGGTTTTCCCAGTCACGAC ; SEQ. ID No. I) and the Ml 3 reverse primer 
(AGCGGATAACAATTTCACACAGGA ; SEQ ID No. 2), Thus, the sizes of the 
amplified fragments for each of these four templates, with the exception of MHCclnl, 
include 200 bp of multiple cloning sites of pBluescript vector. 
20 The two G-^C-rich templates are unsuccessfully amplified (MHCclnl 

and OCT-Tl) using AmpliTaq® DNA polymerase (Perkin-Elmer Cetus) under standard 
conditions. (See Science 259:487, 1988). The addition of DMSG (10% v/v), betaine 
(up to 2.5 M), glycerol (10% v/v), formamide (10% v/v) and TMACl 
(tetramethylammonium chloride) (O.l mM to 1.0 M) or increasing aimealing and 
25 denaturation temperatures did not result in visible amplification. In addition, betaine 
(>L0 M), formamide (>5%), and TMACl (>100 mM) are inhibitory for amplification of 
the non-G+C-rich template OCTl. 

30 EXAMPLE 2 

AMPLIFICATION OF G+C RJCH TEMPLATES USING A TWO ENZYME DNA POLYMERASE 

System and Betaine or DMSO 

Each of the G+C-rich templates is set up in an amplification reaction 
35 using a two enzyme mixture, with and without the addition of betaine (0.5 to 2.5 M), 
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DMSO (5-10%), formamide (5-10%), glycerol (5-10%), TMACl (0.1 mM to I M), and 
tetramethylammonium acetate (TMAA) (0. 1 mM - 1 .0 M). 

The amplification reaction (50 ^il) is performed in 20 mM Tris-HCl, 
pH 9.0 at 25**C, containing 150 }ig/ml of bovine serum albumin, 16 mM ammonium 
5 sulfate, 2.5 mM magnesium chloride, 200 ^iM of each dNTP, 50 ng each of the Ml 3 
forward primer and the Ml 3 reverse primer, 0.5-1.0 ng of plasmid templates and 0.2-0.4 
|il of Klentaq LA16 (a 1:15 v/v ratio of Klentaql (Ab Peptides Inc., St. Louis, MO), 
which is Taq DNA polymerase lacking the 5' to 3* exonuclease activity, and Pfu DNA 
polymerase (Stratagene, LaJolla, CA); the long and accurate (LA) system described in 

10 Barnes, Proc. Natl Acad. ScL USA 97:2216, 1994). DMSO, TMACl, TMAA, betaine, 
or the combination of DMSO and betaine is added to aliquots of the same mixture 
without dNTPs and overlaid with mineral oil. Amplification is performed in a 
thermocycler instrument using thin-walled tubes (0.5 ml). The tubes are heated at 
95*'C for 15 seconds and amplification is initiated by the addition of dNTPs at 80°e. 

15 The following cycling conditions are used for 25 cycles: denaturation at 94*^0 for I'S" 
seconds, annealing at 50 to 55'^C for 1 minute, and extension at 68°C for 5 minutes. 
Following amplification, 10 |il of the resulting product is loaded onto a 1% agarose gel 
(see Molecular Cloning: A Laboratory Manual by J. Sambrook et ah. Cold Spring 
Harbor, 1989), The amplified DNA is stained with ethidium bromide and the gel is 

20 photographed while UV-irradiated by a transilluminator. 

The results are shown in Figures I A and IB. The individual numbered 
lanes in Figures lA and IB, contain the template and additives listed in the following 
table. 

25 TABLE 1 



Lane i 


Template* j 


Additive j 


Lane | 


Template | 


Additive | 


1 


OCTl 


2.2 M betaine 


10 


all four 


10% DMSO 


2 


OCTl 


10% DMSO 


11 


all four 


1 . 1 M betaine 


3 


TFR 


2.2 M betaine 


12 


all four 


2.2 M betaine 


4 


TFR 


10% DMSO 


13 


all four 


5% DMSO + l.l M betaine 


5 


MHCclnl 


2.2 M betaine 


14 


all four 


5% DMSO + 2.2 M betaine 


6 


MHCclnl 


10% DMSO 


15 


all four 


10% DMSO -H l.l M betaine 


7 


OCT-Tl 


2.2 M betaine 


16 


all four 


10% DMSO + 2.2 M betaine 


8 


OCT-Tl 


10% DMSO 


17 


all four 


7% DMSO + LO M betaine 


9 


all four 


5% DMSO 


18 


all four 


5% DMSO + 1.4 M betaine 
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* "all four" refers to the mixture of the 4 templates, OCTl, OCT-Tl. TFR and MHCclnl. 

As shovm in Figure lA, each of the templates could be amplified in the 
presence of betaine or DMSO. Furthermore, DMSO and betaine do not have a negative 
5 effect on amplification. In contrast, the addition of tetramethylammonium chloride 
(TMACl), which is also a tetraalkylammonium ion, does not have a similar effect as 
that found with betaine. Moreover, TMACl and TMAA (at O.l-l.O M) inhibit 
amplification of OCTl. The addition of betaine and/or DMSO aids in the amplification 
of G+C-rich templates (lanes 5-8, Figure lA), while not having any adverse effect on 

10 the amplification of the other templates (lanes 1-4, Figure 1 A). 

Amplification is also performed on a mixture of the four templates, 
varying the concentrations of DMSO and/or betaine. As can be seen in Figure IB, the 
four templates are not amplified uniformly upon addition of either DMSO or betaine 
alone (lanes 9-12). All the templates were amplified to a varying extent in the presence 

15 of 2.2 M betaine (lane 12), 5% DMSO + 2.2 M betaine (lane 14), or 10% DMSO + 1.1" 
betaine (lane 5). When 7% DMSO with l.O M betaine and 5% DMSO in combination 
with 1.4 M betaine (lanes 17 and 18) were tested, all templates were amplified 
imiformly. Thus, by adding both DMSO and betaine, preferably within a range of 1.0 
M betaine with 6-8% DMSO or 5% DMSO with 1.2-1.8 M betaine, all four templates 

20 are amplified. 

Thus, approximately equimolar amplification of all the templates 
occurred at 1.0 M betaine + 7% DMSO (lane 17), and 5% DMSO + 1.4 M betaine (lane 
18). A combination of 1.0 M betaine with 6-8% DMSO or 5% DMSO with 1.2-1.8 M 
betaine resulted in similar uniform amplifications; accordingly such concentrations are 
25 preferred. Although at a higher concentration of DMSO (12-15%) or betaine (1.8-2.5 
M), all the templates showed variable amplification, approximately equimolar yield of 
individual components of the template mixtures were obtained when DMSO and 
betaine were within the preferred concentration range and were combined with the LA 
system. 
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EXAMPLE 3 

Amplification of Templates using Two Enzyme DNA Polymerase Systems and 

Betaine or DMSO 

Various single and two enzyme systems were used to amplify OCT-Tl 
(80% G+C) or TFR (44% G+C) templates. Amplification reactions were performed as 
above in the presence of 2 M betaine. The results are presented below. 



Table 2 



Enzyme System 


Enzymes 


OCT-Tl 
(80% G+C) 


TFR 
(44% G+C) 


Sensitivity 
to Betaine 


LA- 16 


Klentaql and 
Pfu DNA 
Polymerase 








AmpliTaq® DNA 
Polymerase 


xTaq DNA 
Polymerase 


-/+ 


-f-H- 




XL PGR (Perkin-Elmer) 


iTth and T. litoralis 
DNA Polymerase 


-H-+ 


-M-h 




Elongase (BRL) 


Tag and 

Pyrococcus DNA 
Polymerase 








Expand System 
(Boehringer Mannheim) 


Tag and Pwo DNA 
Polymerase 


-H- 






Vent® DNA Polymerase 
(NE Biolabs) 


T. lit or oils 
Polymerase 








Deep Vent^*^ Polymerase 
(NE Biolabs) 


Pyrococcus 
Polymerase 









As shown above, all enzymes, except Pyrococcus polymerase, amplified 
TFR, a template with average G+C content. However, only the two enzyme systems 
were able to amplify OCT-Tl , a template with high G-hC content. 
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EXAMPLE 4 

Amplification of Templates using Structural Analogues of Betaine 

Various analogues of betaine and two tetraalkylanmionium salts are used 
5 to amplify OCT-Tl or TFR templates. Amplification reactions are performed as above 
in the presence of either 2 M analogue or at lea$t 0.15 M salt. Results are presented in 
the Table below. 



Table 3 



Compound 


OCT-Tl 


TFR 


Monomethyl glycine (2 M) 






Dimethyl glycine (2 M) 


+ 




Glycine betaine (2 M) 


-HH- 




D-camitine (2 M) 






TMAC (>0.15 M) 






TMAA(>0.15 M) 
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The structure of the compounds is shown below. 

Table 4 



Glycine betaine 


(CH3)3N— CH2-COO" 


D-C£imitine 


OH 

1 + 
OOC CHj— CH— CH2— N— (CH3). 


Monomethyl glycine 


+ 

CH3— H2N -CH2— COO' 


Dimethyl glycine 


(CH3)2— NH— CH2 COO' 


TMACl 


(CH3),— N-cr 


TMAA 


+ 

(CH3)4 — N — CHaCOO' 



As shown in the table, TMACl and TMAA are unable to facilitate 
15 amplification of either template and most likely inhibit the enzyme. However, the 
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zwitterions facilitated amplification of both templates, the quartemary ammonium ions 
working best. 

EXAMPLE 5 

5 Amplification of G+C Rich Repeats of the Fragile X Gene 

Amplification of the G+C-rich repeat expansion involved in Fragile X 
Syndrome is performed. Fragile X Syndrome is one of the most conunon causes of 
mental retardation affecting 1 in 1,500 males, and 1 in 2,500 females. The fragile X site 

10 has been mapped to Xq27.3, and shown to be due to the expansion of CGG triplet 
repeats in the 5'-end of the FMRl gene. Normal subjects have 2-50 CGG repeats with an 
average at 29 repeats, those with premutation have about 50-200 repeats, and 
individuals with disease have more than 200 repeats (i.e., a full mutation) (see 
Nussbaum and Ledbetter, Ann. Rev. Genetics 20:109, 1986; Verkerk et al.. Cell 65:905, 

15 1991; and Kremer et al.. Science 2J2:1711, 1991). Diagnostic confirmation of this 
disease has primarily been accomplished by cytogenetic analysis of the cells and 
Southern blotting of the DNA firom these patients (see Sutherland, Am. J. Hum. GeneL 
i/:125, 1979; and Rousseau et al., N. Engi J. Med 525.1673, 1991). The sequence of 
the fragile X CGG repeat region may be found in GenBank accession No. X61378.gb- 

20 pr, and encompasses nucleotides 2599 to 2868. 

The methods described herein are used to amplify the G+C-rich fragile X 
CGG triplet repeat region (85% G+C) directly fi-om genomic DNA of a cell line with a 
normal FMRl gene, JV, (29 CGG repeat), genomic DNA from 2 males with 
premutations (74 and 119 CGG repeats, respectively), and genomic DNA from 2 

25 patients carrying full mutations (>1000 CGG repeats). The sizes of the CGG repeats 
were obtained by Southern blotting and hybridization with specific probes. 

Amplification is performed as described above in the presence of 10% 
DMSO + 1.1 M betaine or 5% DMSO + 2.0 M betaine. The sequence of primers used 
are: 

30 

primer c: GCTCAGCTCC GTTTCGGTTT CACTTCC6GT (SEQ ID No. 6) 
primer f: AGCCCCGCAC HCCACCACC AGCTCCTCCA (SEQ ID No. 7) 

35 Either 10 or 100 ng of genomic DNA from each sample is mixed in 

20 mM Tris-Cl (pH 9.0) containing 150 |xg/ml bovine serum albumin, 16 mM 
ammonium sulfate, 2.5 mM magnesium chloride, and 50 ng of each primer. All the 
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components, except dNTP, are denatured at for 1 5 sec, and dNTPs are added at 

SO^'C. Amplification proceeded using the following cycling conditions for 40 cycles: 
denaturation at 94*'C for 1 5 sec, annealing at 65°C for 1 minute, and extension at 68*C 
for 5 min (or annealing and extension was alternatively done at 68^C for 5 mln). 

5 Following amplification, 10 |il of the reaction was loaded on a 2% agarose gel ahd the 
bands visualized by ethidium bromide staining. 

The results are depicted in Figure 2, in which lane I is JY DNA (10 ng); 
lane 2 is JY DNA (100 ng); lane 3 is 74 CGG repeat containing DNA; lane 4 is 1 19 
CGG repeats containing DNA; lane 5 is DNA containing heterozygous full mutation in 

10 a female patient; and lane 6 is a full mutation from a male DNA sample. 

An approximately 300 bp PGR product is observed for JY (Figure 2, 
lanes 1 and 2), and the expected size fragments for 74 repeat (440 bp) and 119 repeats 
(575 bp) are observed as well (Figure 2, lanes 3 and 4). However, no specific band is 
observed for the DNA from a male patient carrying a fiill mutation (Figure 2, lane 5), 

15 and only a normal band (300 bp) is observed for DNA from a female patient carrying a'' 
full mutation as well as a normal gene (Figure 2, lane 6). Thus, the normal sized repeat 
(with an average repeat number of 29) and remutation expansion can be visibly 
amplified, whereas the fill! mutation (>200 repeat) is not visibly amplified. 

20 EXAMPLE 6 

Amplification of cDNAs using Betaine or DMSO 

First strand cDNAs synthesized from total RNA isolated from a small 
number of cells and tissues are amplified. Either Jurkat T cells, induced for 4 hrs with 

25 50 nM phorbol l2-myristate l3-acetate and 2 jig/ml phytohemagglutinin, or human 
inner ear samples, obtained during a surgical procedure, were used. Total RNA was 
isolated, and double-stranded cDNA was made. An adaptor containing a Notl site was 
ligated to the cDNAs. Amplification reactions are performed as described above using 
Klentaq LA 16, using the adaptor as primer in the presence of 1.5 M betaine and 7% 

30 DMSO, except that each cycle consisted of denaturation at 95''C for 15 sec, annealing at 
SS^'C for 1 min, and extension at eS'^C for 5 min. A second round of amplification was 
performed on 1 ^\ of reaction mix using the cycle conditions: denaturation at 95°C for 
15 sec, annealing at SO'^C for 1 min, and extension at 68°C for 5 min. 

As shown in Figure 3, uniform amplification occurred using cDNA from 

35 tissues (Earl and Ear2) as well as from cell lines (JTP, J 1000, J5000). Moreover, even 
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when small numbers of cells are used to extract total RN A, amplification is substantial 
(lanes J 1000, J5000). 

It will be appreciated that, although specific embodiments of the 
invention have been described herein for purposes of illustration, various modifications 
may be made without departing from the spirit and scope of the invention. 
Accordingly, the invention is not limited except as by the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(1) APPLICANTS: Sherman M Weissman 
Namadev Baskaran 

(n) TITLE OF INVENTION: AMPLIFICATION OF NUCLEIC ACIDS 
(ni) NUMBER OF SEQUENCES: 8 

(IV) CORRESPONDENCE ADDRESS. 

(A) ADDRESSEE SEED and BERRY LLP 

(B) STREET: 701 Fifth Avenue. 6300 Columbia Center 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY- USA 

(F) ZIP. 98104 

(V) COMPUTER READABLE FORM. 

(A) MEDIUM TYPE- Floppy disk 

(B) COMPUTER- IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(0) SOFTWARE: Patentin Release #1 0. Version #1.30 

(VI) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 29-N0V-1996 

(C) CLASSIFICATION 

(vm) ATTORNEY/AGENT INFORMATION: 

(A) NAME- Nottenburg Ph D. . Carol 

(B) REGISTRATION NUMBER: 39.317 

(C) REFERENCE/DOCKET NUMBER: 390036. 402C1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 622-4900 

(B) TELEFAX- (206) 682-6031 



(2) INFORMATION FOR SEQ ID NO.l. 

(1) SEQUENCE CHARACTERISTICS 

(A) LENGTH- 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY- linear 



(XI) SEQUENCE DESCRIPTION. SEQ ID N0:1: 
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CGCCAGGGTT TTCCCAGTCA CGAC 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
AGCGGATAAC AATTTCACAC AGGA 
(2) INFORMATION FOR SEQ ID N0:3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 



GGATCCGGCC 


CCACGGA6GT 


CCCCATCTCC 


CTCAAGATTC TCAGAHCAT CCCCAATGAG 


60 


TG6T6TAGCC 


CCTACAGGGG 


TGTCAGCCCC 


CCTCATCACC AACAGTGACA 6TGACAGAGG 


120 


CTGGAGATGA 


GGGGCCA6CA 


G6CTCCAGGG 


AGTCGGGG6T GGCCnGGGC AGGGTTTCTT 


180 


CACTACGAGG 


GGTGTTCCCC 


AAAGAGCCAT 


GAACTGTAGA 6GAAGAGAAA AAGTTCAGAG 


240 


CTAAGGGCTC 


AGGAGATCCT 


GTGTATTTAG 


GGAAGGTGAC GGTCCAATTG GGGCCCGTTT 


300 


TA6CTGCACT 


CACCTCTCTC 


GGTGGCTCCT 


CTGGTTTCCT TCTCCAGCAG CTCCCCCATC 


360 


TCAGCGGGGG 


CCATCCCCCT 


GGGAGGGGAG 


ACAAGGGACA GGAGGGCTGG TCAGCCCAGT 


420 


AGAGAGTTGG 


GGGGTCCAGG 


ATGCCTGGGC 


CCTGGGAAGA GAGAGTAGGC TCCGGGGCCT 


480 


ACCTCTTCCT 


CTGGCCCTTC 


CGCGGCCTCG 


GCTGCCCGGA GCCGCACAAC CCTCCCCGGG 


540 


CCGCATAATC 


CCTCCTTGAT 


GACCCTCCCT 


CTCGGTAGTA CCCGCACTCT GGGGCCGAGA 


600 


GAAGAGGA6G 


GGGCACGGAC 


TCTTGGGGGG 


GGCCTCCGAG CCCGGCCCCG CCCCTCTCCC 


660 


CG6CTGCACG 


CGCCGATACC 


CTTTGTACCC 


AGGCGCGGGA CCCGGACAAT CCTCAGATCC 


720 
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TCCAGCACCC GCTGCCCCCC AGCCCGGTGG ACGGCCCCTC GTGCCCCTCA CGC6TGCTCC 780 

TGGGGCCCCG GCGCCCGTCG CCCAGTGCGG GCAGGCCGGC GGCTGCACGC GCGCCTCCGT 840 

GCCCACTCCC CCCACCTCCC ACACCCTGGT CCCCTCATCC GCCCCCGGTG CTGGCCCCCT 900 

GGATTGCTGC AAGTCCCGCC CGGCCCCCGG CCCCGHGCA CCCCCGGAGC ATTGCACGGC 960 

GCncCCCCG GGGGCGCGCG CGG6CATGCA CCCGCCTCTC CCCCTCCCTT CCGCACCTCG 1020 

6CGGCCGCCG CC6CTGCAGC TCCCGCCGCC GCCGCCATCG CGCTTGCGCT GGG6GCCGAG 1080 

CCGGCGCGCG GCCGCCCCGG GTCACGTGGG C6AG6GAG6G AGGGCGAGGA GGAGCCTTAA 1140 

AGGAGCCGCT ACATGCTTTT T6GCCATTTT CCCCTGA6A6 CGGCCTCG6A 6AT66CTGTG 1200 

ACTGTCCTAA GCTGGGA6CT GCAAGGGAGA ATTCCTGTCA TTCCTG6CCT CAGTTCTGCA 1260 

GGGACCGAGG GCGAGACACG CCTGG6CCCA GGTGTGGCGT CTCTGTCCCC ATCTGGmT 1320 

AGGTAACAAG CGGACGHCT GAACTTCTCG GCTCTCGGCA 6CGGCTGTAT TTCCTCTGGC 1380 . . 

CTGGnGGGC TTTTCCCGCC TCTGGTTGCT TTTCTGCCn TCTAGTTTTT GGGTTACCAG 1440 

ATAGAAGGCT TGGCCTCAGT TTTGGCCTCG CCTTTTTGCT CTTTCTAACG AGCACGAAGG 1500 

GGCGATAGGG ACGCGGAGGA CACCTTTATT CnGGCTGGT TCTAGCATGC TGCnCATGT 1560 

CCCCTGGAGC AGCGTGCCCT TCTGAAAACC TGTGGCTAAA TGTCTCTTCT GTTTATATCT 1620 

G6CGT6TTAC ACCTTCACAC GCACTA6GAT CC 1652 

(2) INFORMATION FOR SEQ ID NO -.4: 

(1) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 530 base pairs 
(8) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

CGAG6CCATC CGGCGGGCCT GCCTGCCCAC GCCGCCGCTG CAGAGCAACC TCTTCGCCAG 60 

CCTGGACGAG ACGCTGCTGG CGCGGGCCGA' GGCGCTGGCG GCCGTGGACA TCGCCGTGTC 120 

CCA6G6CAAG AGCCATCCn TCAAGCCGGA CGCCACGTAC CACACGATGA ACAGCGTGCC 180 

6TGCACGTCC ACTTCCAC6G TGCCTCTG6C 'GCACCACCAC CACCACCACC ACCACCACCA 240 
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GGCGCTCGAA CCCGGCGATC TGCTGGACCA CATCTCCTCG CCGTCGCTCG CGCTCAT6GC 
CGGCGCGGGC GGCGCGGGCG CGGCGGCCGG CGGCGGCGGC GCCCACGACG GCCCGGGGGG 
CGGTGGCGGC CCGGGC6GCG GCGGCGGCCC GGGCGGCGGC GGCCCCGGGG GAGGCGGCGG 
T6GCGGCCCG GGGGGCGGCG GCGGCGGCCC G66CGGCGGG CTCCTGGGCG GCTCCGCGCA 
CCCTCACCCG CATATGCACA GCCTGGGCCA CCTGTCGCAC CCC6CGGCGG 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
ATGGCGATAA CAGTCATGTG GAGATGAAAC HGCTGTAGA T6AAGAAGAA AATGCTGACA 
ATAACACAAA GGCCAATGTC ACAAAACCAA AAAGGTGTAG T6GAAGTATC TGCTATGGGA 
CTATTGCTGT GATCGTCTTT TTCTTGAHG GATTTATGAT TGGCTACTTG GGCTATTGTA 
AAGGGGTAGA ACCAAAAACT GAGTGTGAGA 6ACT6GCAGG AACCGAGTCT CCAGTGAGGG 
AGGA6CCAG6 AGAGGACHC CCTGCAGCAC GTCGCTTATA HGGGATGAC CTGAAGA6AA 
AGHGTCGGA GAAACTGGAC AGCACAGACT TCACCAGCAC CATCAAGCTG CTGAATGAAA 
ATTCATATGT CCCTCGTGAG GCTGGATCTC AAAAAGATGA AAATCTT6CG TTGTATGTTG 
AAAATCAATT TC6TGAATTT AAACTCA6CA AAGTCTGGCG TGATCAACAT TTTGTTAAGA 
TTCAGGTCAA AGACAGCGCT CAAAACTCGG TGATC 
(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY, linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
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GCTCAGCTCC GlTrCGGTTT CACTTCCGGT 
(2) INFORMATION FOR SEQ ID N0:7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

AGCCCCGCAC TTCCACCACC AGCTCCTCCA 

(2) INFORMATION FOR SEQ ID N0:8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2584 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEONESS: Single 
(0) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
GA6GAGCAGC GAGTCAAGAT GAGAGHCAG CCGCGGCGGC AGCAGCAGCA GACTCAAGAA 
TGAACAATCC GTCAGAAACC AGTAAACCAT CTATGGAGAG TGGAGAT6GC AACACAGGCA 
CACAAACCAA TGGTCTGGAC TTTCAGAAGC AGCCTGTGCC TGTAGGA6GA GCAATCTCAA 
CAGCCCAGGC GCAGGCTTTC CTTGGACATC TCCATCAGGT CCAACTCGCT GGAACAAGTT 
TACAGGCTGC T6CTCAGTCT HAAATGTAC AGTCTAAATC TAATGAAGAA TCGGGGGAH 
CGCAGCAGCC AAGCCAGCCT TCCCAGCAGC CTTCAGTGCA GGCAGCCAH CCCCAGACCC 
AGCTTATGCT A6CTGGAGGA CAGATAACTG GGCTTACTTT GACGCCTGCC CAGCAACAGT 
TACTACTCCA GCAGGCACAG GCACAGGCAC AGCTGCTGGC TGCTGCA6TG CAGCAGCACT 
CCGCCAGCCA GCAGCACAGT GCTGCTGGAG CCACCATCTC CGCCTCTGCT GCCACGCCCA 
TGACGCAGAT CCCCCTGTCT CAGCCCATAC AGATCGCACA GGATCTTCAA CAACTGCAAC 
AGCTTCAACA GCAGAATCTC AACCTGCAAC AGTTTGTGn GGTGCATCCA ACCACCAATT 



.•^ .T^ ^'Pjf -7^ ?frr ■r:;'; _ ri?:7r!f 



TGCAGCCAGC 


GCAGITTATC 


ATCTCACAGA 


CGCCCCAGGG 


CCAGCA6GGT 


CTCCT6CAAG 


720 


CGCAAAATCT 


TCAAACGCAA 


CTACCTCAGC 


AAAGCCAAGC 


CAACCTCCTA 


CAGTCGCAGC 


780 


CAAGCATCAC 


CCTCACCTCC 


CAGCCAGCAA 


CCCCAACACG 


CACAATAGCA 


GCAACCCCAA 


840 


TTCAGACACT 


TCCACAGAGC 


CAGTCAACAC 


CAAAGCGAAT 


TGATACTCCC 


AGCTTGGAGG 


900 


AGCCCAGTGA 


CCTTGAGGAG 


CTTGAGCAGT 


TTGCCAAGAC 


CTTCAAACAA 


A6ACGAATCA 


960 


AACTTGGATT 


CACTCAGGGT 


GATGTTGGGC 


TCGCTATGGG 


GAAACTATAT 


GGAAATGACT 


1020 


TCAGCCAAAC 


TACCATCTCT 


CGATTTGAAG 


CCTTGAACCT 


CAGCTTTAAG 


AACATGTGCA 


1080 


AGHGAAGCC 


ACI 1 1 lAGAG 


AAGTGGCTAA 


ATGATGCAGA 


GAACCTCTCA 


TCTGATTCGT 


1140 


CCCTCTCCAG 


CCCAAGTGCC 


CTGAAHCTC 


CAGGAATTGA 


GG6CTT6AGC 


CGTAGGAGGA 


1200 


AGAAACGCAC 


CAGCATAGAG 


ACCAACATCC 


GTGTGGCCTT 


AGAGAA6AGT 


nCTTGGAGA 


1260 


ATCAAAAGCC 


TACCTCGGAA 


GAGATCACTA 


TGATTGCTGA 


TCAGCTCAAT 


ATGGAAAAAG 


1320 


AGGTGATTCG 


TGTTTGGTTC 


TGTAACCGCC 


GCCAGAAAGA 


aaaaa6aatc 


AACCCACCAA 


1380 


GCAGTGGTGG 


GACCAGCAGC 


TCACCTATTA 


AAGCAAI 1 1 1 


CCCCAGCCCA 


ACTTCACTGG 


1440 


TGGCGACCAC 


ACCAAGCCTT 


GTGACTAGCA 


GTGCAGCAAC 


TACCCTCACA 


GTCAGCCCTG 


1500 


TCCTCCCTCT 


GACCAGTGCT 


GCTGTGACGA 


ATCTTTCAGT 


TACAGGCACT 


TCAGACACCA 


1560 


CCTCCAACAA 


CACAGCAACC 


GTGATTTCCA 


CAGCGCCTCC 


agcttcctca 


GCAGTCACGT 


1620 


CCCCCTCTCT 


GAGTCCCTCC 


CCTTCTGCCT 


CA6CCTCCAC 


CTCCGAGGCA 


TCCAGTGCCA 


1680 


GTGAGACCAG 


CACAACACAG 


ACCACCTCCA 


CTCCTTTGTC 


CTCCCCTCTT 


6GGACCAGCC 


1740 


AGGTGATGGT 


6ACAGCATCA 


GGTTTGCAAA 


CAGCAGCAGC 


TGCTGCCCTT 


CAAGGAGCTG 


1800 


CACAGTTGCC 


AGCAAATGCC 


AGTCTTGCTG 


CCATGGCAGC 


TGCTGCAGGA 


CTAAACCCAA 


1860 


GCCTGATGGC 


ACCCTCACAG 


TTTGCGGCTG 


GAGGTGCCn 


ACTCAGTCTG 


AATCCAGGGA 


1920 


CCCTGAGCGG 


TGCTCTCAGC 


CCAGCTCTAA 


TGAGCAACAG 


TACACTGGCA 


ACTATTCAAG 


1980 


CTCHGCTTC 


TGGTGGCTCT 


CTTCCAATAA 


CATCACTT6A 


TGCAACTGGG 


AACCTGGTAT 


2040 


TTGCCAATGC 


GGGAGGAGCC 


CCCAACATC6 


TGACTGCCCC 


TCTGTTCCTG 


AACCCTCAGA 


2100 


ACCTCTCTCT 


GCTCACCAGC 


AACCCTGTTA 


GCnCGTCTC 


TGCCGCCGCA 


GCATCTGCAG 


2160 


GGAACTCTGC 


ACCTGTAGCC 


AGCCTTCACG 


CCACCTCCAC 


CTCTGCTGAG 


TCCATCCAGA 


2220 


ACTCTCTCn 


CACAGTGGCC 


TCTGCCAGCG 


GGGCTGCGTC 


CACCACCACC 


ACCGCCTCCA 


2280 


AG6CACAGTG 


AGCTGGGCAG 


AGCTGGGCTG 


CCAGAAGCCT 


nnCACTCT 


GCAGTGTGAT 


2340 
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TGGACTGCCA GCCAGGTTAA TAAACTGAAA AATGTGATTG GCHCCTCTC 6CCGTGTTGT 2400 

GAGGGCAAAG GA6AGAAGGG AGAAAAAAAA AAAAAAAACC ACACACACCC ATACACAATA 2460 

TACCAGAAAA G6AAGGAAGG AT66A6AC6G AACATTTGCC TAATTTGTAA TAAAACACTG 2520 

TCTTTTCAGG GTTGCTTCAT GGGTTGGAGG ACTTTCTAAC CAAAAAHAA AAAAAAAAAA 2580 

AAAA 2584 



